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ON THE EXISTENCE OF IDENTIFIABLE 
REPARAMETRIZATIONS FOR LINEAR COMPARTMENT 

MODELS 


JASMIJN A. BAAIJENS AND JAN DRAISMA 

Abstract. The parameters of a linear compartment model are usually estim¬ 
ated from experimental input-output data. A problem arises when infinitely 
many parameter values can yield the same result; such a model is called uniden¬ 
tifiable. In this case, one can search for an identifiable reparametrization of 
the model: a map which reduces the number of parameters, such that the 
reduced model is identifiable. We study a specific class of models which are 
known to be unidentifiable. Using algebraic geometry and graph theory, we 
translate a criterion given by Meshkat and Sullivant for the existence of an 
identifiable scaling reparametrization to a new criterion based on the rank of 
a weighted adjacency matrix of a certain bipartite graph. This allows us to 
derive several new constructions to obtain graphs with an identifiable scaling 
reparametrization. Using these constructions, a large subclass of such graphs 
is obtained. Finally, we present a procedure of subdividing or deleting edges 
to ensure that a model has an identifiable scaling reparametrization. 


1. Introduction 

Linear compartment models are used to describe the transport of material between 
different compartments of a system and appear widely in the fields of systems bio¬ 
logy and pharmacokinetics. These models can be given by a directed graph, where 
the edges represent the transport of material from one compartment to another. 
The rate of flow from i to j is assumed to be time-invariant and linear in the amount 
of material in compartment i. We will study identifiability of a particular class of 
models and conditions for the existence of identifiable scaling reparametrizations, 
following and extending the ideas of Meshkat and Sullivant [MS 14) . 

1.1. Problem description. The parameters corresponding to a linear compart¬ 
ment model are often unknown and are therefore estimated from experimental data. 
An important step in the modeling process is to check before experimenting whether 
several or even infinitely many parameter sets could yield the same data. If this 
is the case, it is impossible to tell which parameter values are correct, hence the 
parameter estimates could lead to wrong predictions. 

We assume that the experimental data consists of input-output values: the input 
corresponding to the amount of material that was added to the system in certain 
input compartments, and the output corresponding to the amount or concentration 
of material measured in the output compartments. Roughly, a model is called 
identifiable if we can recover the parameter values from the (noiseless) input-output 
data of this experiment; we refer to Section|2]for the precise notion in our context. 
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If there is a finite number of possible parameter values corresponding to given input- 
output data, then we can indeed recover the parameter values, at least locally. 

A compartment model can be described by a directed simple graph G = {V, E), 
i.e. a directed graph without loops or multiple edges. Throughout this paper, 
a graph G is assumed to be directed unless stated otherwise, with n = \V\ the 
number of vertices in G and m = \E\ the number of edges in G. Let [fc] denote the 
set {1,..., fc} for given k € N. We associate to G the nxn parameter matrix A{G) 
defined by 



( 1 ) 


where the (i,j G [n], i ^ j) are independent real parameters representing the rate 
of transfer from compartment j to compartment i. Possible outflow of material to 
the exterior is taken into account: each compartment is allowed to have a leak, given 
by aoi , which represents the rate of transfer of material from compartment i to some 
compartment outside the system (the environment). The diagonal entries of A{G) 
are defined as an = —aoi — negative total flow out of compartment i. 

Observe that the parameter matrix A{G) uniquely determines G and vice versa. 

The parameter space of a compartment model given by G consists of all matrices 
of the form A{G). This space will be denoted by ©g C to emphasize that the 

parameter space depends on the graph G. The elements of ©g are nxn matrices 
which have zeros on positions {i,j) with i ^ j such that _) —>■ i is not an edge in 
G. In particular, the elements of ©g have n + m nonzero positions which we can 
choose freely. 

Remark 1.1. Almost every statement in this paper involves a matrix A corres¬ 
ponding to the given graph G: either A = A{G) or A G ©g- When we write 
A = A{G), we mean the symbolic matrix defined in equation ([T|). On the other 
hand, by A G ©g we mean a matrix with the zero pattern of A(G) and parameter 
values substituted for the symbolic entries , i.e. an element of 

A linear compartment model described by a graph G gives rise to a system of 
linear differential equations. Let x G M" be the state variable representing the 
concentration of material in each compartment, let u G R” be the input vector 
corresponding to the input data of the experiment, and let y G M" be the output 
vector representing the measurement data. Furthermore, let A = A(G) be the 
parameter matrix corresponding to G, and let B G be a matrix that indicates 

from which compartments the output is obtained. Then the transport of material 
through the compartments can be described by a parametrized system: 


x(t) = Ax{t) + u{t) 
y{t) = Exit). 


( 2 ) 


Note that the matrices A and B do not depend on the time t, since we assume the 
model to be time-invariant. 

As in [MS14| . we only consider a specific class of linear compartment models, 
namely the models that satisfy the following three assumptions: 


Assumption 1.2. The input and output take place only in compartment 1. 
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(a) Linear compartment model 


(b) Graph representation 


Figure 1. General 2-compartment model 

This implies that the input vector is of the form u = (mi, 0, ... ,0)^ G R” and 
that the output vector y is of the form (a;i,0,... ,0)^. Therefore, system ([2]) can 
be simplified to system ©• 

x{t) = Ax{t) + uii:) 
y{t) = xi{t). 

The output y is no longer a vector in R", but just a value in R. Because of 
this assumption, we do not need to indicate the input and output in the graph 
representation of a given model. 

Assumption 1.3. The graph G is strongly connected. 

In other words, there is a directed path from any vertex in G to any other vertex 
in G. A path will be denoted by a sequence of vertices: the sequence {vo,vi, ..., Vk) 
represents the path from vq to Vk using the edges vq —)• ui, ui —>• U 2 , • ■ ■, Vk-i —>■ Vk- 

Assumption 1.4. Every compartment has a leak, and the leak parameters aoi,i € 
[n] are independent from each other and from the edge parameters Oij with i,j G 

[n], 

This assumption ensures that all parameters a^- for i,j G [n], including the 
diagonal ones defined as above, are independent. As a consequence, the dimension 
of the parameter space 0g equals m+n. A leak at compartment i would correspond 
to an edge from i to the environment, but these edges are not included in G. 

Remark 1.5. In a biological setting, the parameters Oij with i ^ j must be 
nonnegative, or they would correspond to a negative flow. Combining this with 
the assumption that every compartment has a leak, it follows that the parameters 
an, defined as —aoi — must be strictly negative. These constraints are 

not accounted for in our identifiability analysis, but they may help to recover the 
correct parameters when a model is only locally identifiable or even unidentifiable. 

Example 1.6. Consider the general 2-compartment model and its graph repres¬ 
entation in Figure [TJ This model can be described by the following ODE system: 

xi{t) = -(aoi + a 2 i)a:i(t) -I- 012 X 2 (t) + u{t) 

X 2 {t) = 021X1 (t) -b -(012 + O 02 )x 2 (t) 
ylt) =xi{t). 

Define on = —oqi — 021 and 022 = —012 — 002 , then the equations for xi,X 2 can 
be written as 


x'l it) 


Oil O 12 

Xi(<) 

1 

Ul{t) 

_X2(t)_ 


021 022_ 

_X2 (t)_ 


0 
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which bring the system of equations in the form of system ([3]). 

When considering model identifiability, a problem that arises is what to do with 
unidentifiable systems. As we shall see in Section [51 a model satisfying Assump¬ 
tions 11.2111.41 cannot be identifiable unless n = 1. We follow the approach of |MS14] 
by searching for identifiable combinations of parameters and using these to find 
identifiable scaling reparametrizations of the original model. An identifiable repa- 
rametrization is a map which transforms the model into a model whose parameter 
space is lower dimensional, such that it is identifiable. We restrict ourselves to ra¬ 
tional scaling reparametrizations; these correspond to a rational scaling of the state 
variables, as described in Section The advantage of a scaling reparametrization 
is that it has a relatively simple connection to the original model. Although the 
parameters of the reparametrized model do not allow us to estimate the original 
parameter values, they correspond to certain combinations of the original paramet¬ 
ers and hence we can predict relative size changes of these parameters. Our primary 
goal is to classify models for which there exists an identifiable scaling reparametri¬ 
zation. As we will see, already under the restrictive assumptions above this task 
requires new combinations of mathematical techniques. 

A related question is how to construct, from a model which has an identifiable 
scaling reparametrization, larger models that also admit such a reparametrization; 
or, alternatively, how to transform a model that does not admit an identifiable 
scaling reparametrization into one that does. To answer these questions, it is crucial 
to understand the structures in a graph that allow the corresponding model to 
have an identifiable scaling reparametrization. In Section |4| several combinatorial 
constructions and necessary or sufficient conditions are presented for the existence 
of an identifiable scaling reparametrization. And while our model assumptions are 
indeed quite restrictive, we will show that some of our results extend to a more 
general setting. 


1.2. Previous work. The conce pt of id entifiability of dynamical systems was in¬ 
troduced by Bellman and Astrom BA70 in 1970, and has been studied extensively 
since. Godfrey [God88) gives a thorough description of compartment models and 
applications, also treating the concept of identifiability. There have been different 
approaches to determining whether a system is identifiable or not, where one has 
to distinguish between local and global identifiability, as defined in |bA 70 . These 


methods i nclude a Taylor series expansion approach |Po h78], a Laplace transform 
approach BA70 , a similarity transformation approach [CG851 lYEGOQ) also known 
as exhaustive modeling, and a differential algebra approach [LG941 IMEDOQj . A 
graph theoretical approach to parameter identifiability is described in |BHS14] . A 
practical comparison of different algorithms for parameter identifiability analysis of 
biological systems is given in (RKS~*~14] . 

We consider the question of what to do with unidentifiable systems. In this case 
there are too many parameters (unknowns) compared to the amount of informa¬ 
tion obtained from the experiment, so the parameter space has to be constrained 
somehow. This can be done using a reparametrization of the original system, which 
reduces it to a model having a parameter space of lower dimension. A procedure 
for finding such a reparametrization has been discussed for the differential algebra 
approach |LG941 IMED091 IMADll) , for the Taylor series approach [EC00| , and for 
the similarity transformation approach [GG981IMTK15) . In [MTKI5I IYEG09| it 
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is observed that non-identifiability of a system of ODE’s is often due to Lie point 
symmetries. We would like to point out that this is not the entire story in our 
setting; see Remark 1531 As we only consider a specific class of models, the problem 
of finding identifiable reparametrizations becomes easier compared to the general 
setting discussed in these references. 

The motivation for this restrictive setting is that our goal is different from the 
results described above. We are interested in certain structures of graphs that cause 
the lack or allow the existence of an identifiable scaling reparametrization for the 
corresponding model. After all, it would be very useful to learn not just whether a 
given model has an identifiable scaling reparametrization, but also why it does or 
does not, and how the model can be adapted to obtain one that does admit such 
a reparameterization. Our aim, in a restricted setting, is therefore more ambitious 
than existing results for more general settings. We build upon [MS 14) . where the 
same class of models is analyzed and several results for the existence of identifiable 
reparametrizations are derived. Meshkat and Sullivant also present an algorithm 
to find such a reparametrization if one exists. We will discuss their main results 
before extending these ideas. 

1.3. Organization of this paper. Our main goal is to obtain a classification of 
graphs (satisfying our assumptions) for which there exists an identifiable scaling 
reparametrization. Section summarizes the definitions and results from [MS14j , 
which we will need for our analysis. This involves the concepts of identifiability 
and identifiable (scaling) reparametrizations, and a criterion for the existence of 
an identifiable scaling reparametrization for a given model. This criterion will be 
referred to as the dimension criterion. 

In Section [3] we derive a reformulation of the dimension criterion, which is the 
main result of this paper: 

Theorem 1.7. Let G = (V, E) he a graph satisfying Assumvtions \1.2}\1.4\ Then G 
has an identifiable scaling reparametrization if and only if the matrix B{G) defined 
by 

if i = k, j ^ I and I ^ j £ E 
if i ^ k, j = I and i ^ k £ E 
if i = k and j = I 
otherwise. 

has full column rank. 

Next, in Section 2] we search for constructions that can be applied to a given 
graph, such that the resulting graph has an identifiable scaling reparametrization. 
We briefly discuss some results of |MS14) . followed by several new constructions. 
The concept of an ear decomposition of a graph is discussed, and we prove the 
following theorem: 

Theorem 1.8. Let G be a graph that has a nontrivial ear decomposition, i.e. an 
ear decomposition without trivial ears whose initial cycle contains vertex 1, then G 
has an identifiable scaling reparametrization. 

Interestingly, this result also holds for models with multiple inputs or outputs 
besides compartment 1, thus relaxing Assumption 11.21 and Assumption 11.41 can 
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even be completely removed. In the remainder of Section |4] we present some com¬ 
putational results (obtained using Mathematica) on the size of certain classes of 
graphs for which an identifiable scaling reparametrization exists. 

Finally, in Section [5] we summarize our results and we present some directions 
for future research. 


2. Identifiable scaling reparametrizations 

In this section we give a brief overview of definitions and results from [MS 14] . In 
order to relate the observed data to the unknown model parameters, one constructs 
an input-output equation |BEC13| : 

il){y,u,A) = 0 . 

This equation depends only on the parameter matrix A, input u, and output y, but 
it may also contain derivatives y,y,... ,u,u,.... 

Given a matrix M, let Mi denote the submatrix obtained by deleting the first 
row and column of M. Recall that according to Assumption 11.21 compartment 1 is 
the input-output compartment. For linear ODE models satisfying our assumptions, 
we obtain the following input-output equation: 

Theorem 2.1 1 [MS 141 Thm. 2.2]). Let G be a graph satisfying Assum,vtions \ 1. S\\l.4\ 
and let A = A{G). Then the input-output equation corresponding to ([3]) becomes 

(4) + ... + Cny = A -f ... -f dn-iui 

where ci,..., c„ and di, ..., dn-i are the coefficients of the characteristic polyno¬ 
mial of A and Ai , respectively. 

The input-output equation gives rise to a coefficient map c that maps a parameter 
matrix A € Qq to the coefficient vector of the input-output equation. This vector 
contains the coefficients of y, y, y,..., u, u, it,... in terms of the parameter values 
Oij. Erom the above equation we obtain the coefficient map c : ©g —S’ given 

by 

c(.A) .— (ci,..., Cji , d\ ,..., dn—\) , 

where ci,..., c„, di,..., dn-i are the coefficients of equation 0. This map is called 
the double characteristic polynomial map. Because of Theorem 12.11 the matrices A 
and Ai and the double characteristic polynomial map c will play a major role in 
the rest of this paper. 

Using c we can define identifiability in mathematical terms. Suppose two distinct 
parameter matrices A, A' yield the input-output data, i.e. c{A) = c{A'). Then it 
is impossible to tell from only observing the relations among input and output 
whether the parameter values corresponding to the model should be those of A or 
of A' . The model is called globally identifiable if this does not happen, i.e., if c is 
injective on the parameter space. This is typically too strong a condition; as in 
[MS14| . we will reserve the predicate identifiable for the following notion, which is 
also called generically locally identifiable: if a map / from a parameter space 0 
captures the observable quantities of a model, then the model is called identifiable 
if there is a dense subset U of 0 such that each A € U has a neighborhood on 
which / is injective. 
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However, taking 0 = 0o and f = c, Meshkat and Sullivant show that the result¬ 
ing model is not identifiable unless n = 1. One approach when dealing with uniden¬ 
tifiable models is to restrict the parameter space to a lower-dimensional space. An¬ 
other approach, taken here and in [MS 14) , is to look for combinations of parameters 
that are identifiable. Motivated by the origin of the model corresponding to G, they 
restrict to combinations that take the form 

bij = CLijfi{A)/fj{A) 

where fi : Qq M, z = l,...,n are functions. Then the bij have a natural 
interpretation as the rates of the dynamical system with scaled variables 

Ai = MA)xi. 

To achieve that Xi equals the quantity in the input-output compartment 1, we 
further impose that fi{A) = 1 for all A. Together, the functions fi, i & [n] are 
called a scaling reparameterization. Since the matrix B = {bij)ij is obtained from 
A by conjugating with a diagonal matrix, we have c(B) = c{A) and bu = an for all 
z G [n]. Thus c induces an ^-valued function c on the image 0' of the map that 
sends Ato B. If c : 0' is identifiable, then the scaling reparameterization 

is called identifiable. If an identifiable scaling reparametrization exists, then it can 
always be chosen such that the fi are in fact monomials in the entries of A [MS 14) . 

Lemma 2.2 1 [MS 141 Cor. 2.13]). If G has an identifiable scaling reparametrization, 
then G has at most 2n — 2 edges. 

We focus on determining whether or not an identifiable scaling reparametrization 
exists for models satisfying our assumptions. Because of Lemma |2.21 we only need 
to consider graphs which have at most 2n — 2 edges, so from now on we assume 
m < 2n — 2. The main result of Meshkat and Sullivant gives a criterion to decide 
if an identifiable scaling reparametrization exists: 

Theorem 2.3 1 [MS141 Thm. 1.2]). A graph G has an identifiable scaling repa¬ 
rametrization if and only if the dimension of the image of the double characteristic 
polynomial map is m -\- 1. 

We will refer to Theorem 12.31 as the dimension criterion. This criterion reduces 
the problem of deciding whether or not an identifiable scaling reparametrization 
exists to calculating the dimension of the image of the double characteristic poly¬ 
nomial map. We observe that the dimension criterion allows us to check whether G 
has an identifiable scaling reparametrization by calculating the rank of the differ¬ 
ential (or Jacobian) of the map c at a sufficiently general point A G 0^. This 
follows from the fact c is a polynomial map, surjective on imc, hence for A in an 
open dense subset of 0 g the rank of dac equals the dimension of the image of c 
[Har921 Prop. 14.4]. 

Definition 2.4. We say that G has the expected dimension if the dimension of the 
image of the double characteristic polynomial map equals m -|- 1. 

In other words, an identifiable scaling reparametrization exists if and only if G 
has the expected dimension. If this is the case, the scaling reparametrization can 
be found using the algorithm presented in [MS14] . 
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3. Reformulating the dimension criterion 


So far, we have discussed all relevant definitions and earlier results. From now on 
we take a new approach, starting with a reformulation of the dimension criterion 
that was given in Theorem 12.31 This leads to our main result: an alternative 
criterion to test whether a given graph has the expected dimension. In other words, 
we determine if an identifiable scaling reparametrization exists. This criterion can 
be verified in probabilistic polynomial time. 

So far we have been working over the real numbers, since all parameters are 
assumed to be real. However, R lies Zariski dense in C and the dimension of the 
image of the double characteristic polynomial map c is determined by the rank of 
its Jacobian at a sufficiently general point. Therefore we might as well work over 
the complex numbers to determine the dimension of the image of c. From now 
on let 0G C and let c : ©g —t Working over the complex numbers 

simplifies issues concerning diagonalizability, which we shall be using later on. 

We consider the matrix group GL„(C), the general linear group, consisting of all 
invertible n x n matrices over C. For simplicity GL„ is written instead of GL„(C). 
The tangent space of GL„ at the identity is its Lie algebra g[„. This space consists 
of all n X n complex matrices, with the commutator serving as the Lie bracket: 
\X, A\ := XA — AX. We write g[„ instead of to emphasize that it arises as 
the tangent space of GL„. 

Furthermore, given a matrix A S C”^", the centralizer of A in g[„ is denoted 
(H). It contains all X G qI„ that commute with A, i.e. \X, H] = 0. 

3.1. The kernel of the differential map. For sufficiently general A G ©g we 
have the following chain of equalities: 


dim im c = rk(dyic) = m + n — dim ker(dAc) 


where cIac denotes the differential (or Jacobian) of c at the point A. The first 
equality was already mentioned in the previous section and the second equality 
follows directly from the rank-nullity theorem. Thus we have shown the following 
lemma. 

Lemma 3.1. The dimension of the image of the double characteristic polynomial 
map c eguals m + 1 if and only if the dimension of the kernel of the differential dAC 
equals n — 1 for sufficiently general A G ©g- 

Using this result, we can determine whether a given model has the expected 
dimension by calculating the rank of the differential d^c. In order to classify which 
models have the expected dimension, we need to know what the kernel of d^c looks 
like. By definition of the double characteristic polynomial map c, the kernel of d^c 
is equal to the intersection of the two kernels corresponding to the differentials of 
the characteristic polynomials of A and Ai. Using this observation we will derive 
the form of ker(dAc). 

Proposition 3.2. For sufficiently general A G Q, the kernel of the differential map 
dAC : ©G —t is given by 




3Xeg[„: [X,H] 

3reg[„_i: [F,Ai] 


where Ai,Ci denote the matrices obtained by removing the first row and the first 
column from A, C, respectively. 



EXISTENCE OF REPARAMETRIZATIONS FOR LINEAR COMPARTMENT MODELS 


9 


Remark 3.3. Suppose that X is an u x n-matrix with zeroes in the first row and 
column except on position (1,1) and that the linear map A i—>■ [X, A] maps 0 g into 
itself. Then taking T = Xi we hnd that [X, A] lies in the set in the proposition 
for each A ^ Qq. The set of such X form a Lie algebra containing the diagonal 
matrices and hence spanned by some subset of the elementary matrices Eij , and this 
Lie algebra can be easily determined from the graph G. This Lie algebra captures 
Lie point symmetries of the ODE, as in [MTK15] . But the set in the proposition 
is often larger than (the image of) this algebra, and indeed does not correspond to 
any Lie algebra acting on the parameter space. 


Proof. We begin by writing c{A) = [co(^)|ci(A)], where co,Ci are the coefficient 
maps corresponding to the characteristic polynomials of A,Ai^ respectively. Let 
d^co G c>^x(n+m) d^ci G ([^(n-i)x{n+m) differential maps of co,ci, re¬ 

spectively, containing the partial derivatives with respect to the model parameters. 
The differential of c can be written as 


dAC = 


{dACoY ' 
(d^ci)^ 


which shows that X lies in the kernel of d^c if and only if X lies in both the kernel 
of dyico and d^^ci. 

Consider the map cq : C”^" —>■ C" and define the map i/; : GL„ -A- that 

sends g to gAg~^ for some fixed A G C"^". Define the composition (j) := cq o i/;, 
which is of the form 


GL„ 

g 



gAg ^ 


Co 


■> 


C" 

(fll, . . . , (Xri) 


where oi,..., a„ are the coefficients of the characteristic polynomial of gAg~^. The 
characteristic polynomial of A is invariant under conjugation by an element of GL„, 
hence ai,..., a„ are equal to the coefficients of the characteristic polynomial of A. 
This implies that the composition cq o i/i is in fact a constant map sending each 
g G GL„ to the fixed point (ai,...,a„) G C". Therefore, the differential dcj) is 
identically zero; in particular djcf) = 0 for the n x n-identity matrix I. 

By the chain rule, djoj) = (d^co)(d/i/>), so di(j) = 0 implies that the image of d/i/> 
is contained in the kernel of d^co. For X G we have (d/i/;)(X) = [X, A], hence 
the image of d/^ equals [g[„, A]. We conclude that C ker(d^co). 

On the other hand, since cq is a surjective polynomial map from to C", the 
dimension of the kernel of d^co is generically equal to n? —n. The kernel of the Lie 
bracket [•, A\ is precisely the centralizer of A in g[„, which has dimension n according 
to Lemma [3^ below. This shows that the dimension of [g[„, A] equals v? — n, and 
because the kernel is a linear subspace of Og we conclude that ker(d^co) = [gl„, A]. 

The same argument applies to the map ci : —>■ C”“^, showing that 

ker(d. 4 Ci) = {C | Ci G [gl„_]^,i?]} for any A G Og- Gombining these results, 
we see that C lies in the kernel of d^c if and only if C lies in [g[„. A] and Ci lies in 
[g[„_ 2 , Ai]. In other words, there exist X G g[„ and Y G gl„_i such that [X, A\ = C 
and [F, Ai] = Cl. □ 


Lemma 3.4. For sufficiently general A G C”^", the centralizer of A in gl„, denoted 
(A), has dimension n. 
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Proof. Let A G be such that it has n distinct, nonzero eigenvalues. The 

elements of Zqi^{A) must be diagonalized by the same basis that diagonalizes A, 
and such elements are determined by their eigenvalues on this basis. This leaves us 
n degrees of freedom, so the centralizer has dimension n. □ 

3.2. The preimage of the kernel. In the previous section we saw that G has 
the expected dimension if and only if the kernel of the differential of the double 
characteristic polynomial map has dimension n — 1. So far, we have determined 
this kernel, but what can we say about its dimension? 

For given A G 0g, the kernel of cIac equals the image of the commutator map 
X i-A [X,A] restricted to the linear subspace Va Q gin defined by 

Va ■■= {X G I [X, A] G kevidAc)}, 

which is the preimage of ker((i^c) under the commutator map. From the fact that 
[•,A] is a linear map it follows that Va is a linear subspace of g[„. Furthermore, 
any X that commutes with A is contained in Va, since = 0 and [X, A]i = 

0 = [F, Ai] for any V G Z^i^_^{Ai). From Lemma [3.41 we know that the kernel of 
the commutator map has dimension n, which implies that 

(5) dimker(dAc) = n — 1 dimI/A = 2n—1. 

In words, G has the expected dimension if and only if the dimension of I^ is 2n — I. 
Therefore, we will examine the structure of Va for a given graph G. Besides the 
centralizer ot A, Va will always contain the space 'Dn of all n x n diagonal matrices 
with entries in C. Indeed, by computing DA — AD for D G T>n we find 

(6) {DA-AD)ij = {da - djj)aij for i, j = 1,...,n. 

If position {i,j) of A is zero, it follows that position (f, j) of is zero as well. 

This shows that has the correct zero pattern, i.e. [Ll,4l] G ©g- Moreover, 

one can check that 

[D,,A,] = [D,A], 

so the second constraint for being in the kernel of dAC is also satisfied. The space 
of n X n diagonal matrices is again n-dimensional, hence we already have two 
n-dimensional subspaces of Va- However, these two subspaces have a nontrivial 
intersection, as the next lemma shows. 

Lemma 3.5. Zgl^ (H) n Vn = G-In for sufficiently general A G ©g and G strongly 
eonneeted. 

Proof. Suppose that X = diag(Ai,..., A„) G Zgi^{A) n then by definition of 
(H), X satisfies XA = AX. Combining this equality with equation (|6]) shows 
that for Oij 0 this equality implies that XiOij = AjOy and hence \i = \ j . Since 
G is strongly connected, starting from vertex I we can get to any other vertex j 
along some path (I, , ik,j). The corresponding entries oqi, , • ■ •, aji^ are 

nonzero for sufficiently general A, and by the previous observation it follows that 
Ai = Ai^ = ... = Aj. But we can find such a path for any vertex j G [n], so we 
conclude that Ai = ... = A„ and therefore X must be of the form c/„, c G C. □ 

What we have seen so far is that Zgi^{A) + C Va for any G. According to 
Lemma 13.51 this is a subspace of dimension 2n — 1, so the dimension of Va is at 
least 2n — I. Combining this with equation ([5]), we obtain the following corollary. 
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Corollary 3.6. G has the expected dimension if and only if 

+ = {o}. 

We shall now derive several restrictions on the form of elements of the quotient 
space in the above corollary. An important tool will be the following lemma: 


Lemma 3.7. Let G be a graph, not necessarily strongly connected, and let A G Qq 
sufficiently general. Suppose v = {vi,... ,t„) is an eigenvector of A. If Vi 0 and 
there exists a path from i to j in G, then also vj 0. 


Proof. Let v = (ti, . .. ,v„) G C” such that Av = Xv. Partition the indices 1,...,n 
into two sets, [n] = J U J, such that vt = 0 Vi G I and vj ^ 0 Vj G J. Construct 
the I J| X I J| matrix A' by removing the rows and columns of A indexed by elements 
of I. Similarly, let v' be the vector obtained from v by removing its zero entries. 
Then we have A'v' = Xv' and for sufficiently general A' this determines the vector 
v' G up to multiplication by a scalar (recall that the diagonal of A' consists of 
free parameters independent of the other parameters). Since v is obtained from v' 
by adding zero entries at positions indexed by /, also v has been determined up to 
scalar multiplication. However, for Av = Xv to hold, v must satisfy a system of n 
linear equations of the form 

'^OijVj = Xvi, iG[n]. 
j&J 

We know that v must be a solution of the subset of these equations corresponding 
to i G J, since Av' = Xv'. The equations that remain to be satisfied are of the form 

(7) iGl. 

jeJ 

The entries Vj with j G J are already fixed and only depend on the matrix A', so 
for sufficiently general A the Vj are completely independent of the entries Oy with 
i G I. Therefore, if Vj ^ 0 and ^ 0, the nonzero term aijVj cannot be cancelled 
from equation ©• So for V to satisfy Av = Xv, one must have Vi 0 whenever 
there exists j G [n] such that j ^ * is an edge in G and Vj ^0. 

Now suppose Vi 0 and there exists a path {i, si,..., St,j} in G. Then by our 
previous observation, we have 


Vi 0 Vsi 0 ’^St 7^ 0 Vj 0. 


□ 


This lemma implies that the support of v is the union of vertex sets of strongly 
connected components of G. In particular, if G is strongly connected, then v does 
not have any zero entries. 

Proposition 3.8. Let G be strongly connected and A G &g sufficiently general. 
Then any class [A] gVa! (^ 0 i„(^) +TXn) has a representative x = (xij) G Va 
whose first row, first column and diagonal are all zero, i.e. xn = xu = xa = 0 for 
all i G [n]. 

Proof. Let [A] GVa j {Zgi^(A) + T>n). First we show that there exists a represent¬ 
ative X of [A] whose first row and the diagonal are zero, then we use these facts to 
show that also the first column must be zero. 
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We claim that projecting M G Zgi^{A) onto its first row yields a bijection 
between (A) and C". Note that indeed both spaces are n-dimensional. The 
set of diagonalizable matrices is dense in C"^", so a sufficiently general A G Qq 
is diagonalizable. Let A = PDP~^ be the eigendecomposition of A, the columns 
of P forming a basis of eigenvectors. If A is diagonalizable, then MA = AM if 
and only if M = PD'P~^ for some diagonal matrix D'. Since G is strongly con¬ 
nected, Lemma 13.71 implies that P contains no zeros. Hence if M is nonzero, then 
MP = PD' implies that the first row of M has at least one nonzero position. 
Therefore the projection Zgi^{A) -G M" to the first row is injective and as both 
spaces have dimension n, it is also surjective. 

Now choose M G such that its first row equals the first row of X and 


choose a diagonal matrix D G whose diagonal equals the diagonal oi X — M. 
Then M + D G Z^tJA) -h V„ and [X] = [X - (M + D)] G Va / (Z^tJA) + 1?„), 
hence x = X — {M + D) is a representative of [X] satisfying xu = Xu = 0 for all 
i G [n]. 

What remains to be shown, is that for X G Va which has its first row and 
diagonal all zero, also the first column of X must be zero; in fact, in the proof we 
will use only that the first row is zero. Write both X and A as block matrices: 


X = 


0 

0^ 

Xi 

Xi 


and A = 


Oil 

T 1 

ai 

02 

Hi _ 


where Xi,ai and 02 are vectors in C" ^ and Xi,Hi are matrices in i)x(n 1 )^ 
Multiplying these matrices to obtain XA — AX, we see that 

[X, H]i = XiAi — AiXi xiaj’. 


For X to lie in Va there must exist V G g[„_i such that [X,H]i = [X,Hi], so we 
obtain 

xia^ = (Y- Xi)Hi -Ai(V- Xi) = [X - Xi,Hi]. 

We need to show that for sufficiently general A this implies xi = 0, i.e. 

{xiof I xi G C"“^} n [g[„_i,Hi] = {0}. 

Observe that the first space has dimension n — 1, while the dimension of the second 
space equals dimg[„_i — dim(Hi) = (n — 1)^ — (n — 1). This suggests that 
their intersection might indeed be trivial. 

Let B = xiof G [gl„_i. Hi] en let vf ,..., w^_i the row eigenvectors of Hi, where 
z)i,... ,Vn-i G form a basis (since Hi is sufficiently general). We claim that 


(8) vf B G ^ CvJ for i = 1,..., n — 1. 

3¥=i 

Indeed, we can write B = [C, Hi] for some C G 0(„_i, so that, for a fixed i, 
vjB = vjCAi - C = vj (CHi - \C) = vf C(Hi - XJ). 

Now vfC = fo'' some ai,..., a„_i G C. As vf (Hi — Xil) = 0, we find 

n—1 

v'[B = Y^ ajvf (Hi - XJ) = ajiXj - X,)vJ G 0 CvJ, 

i/i i/* 

as claimed. Now decompose aj = where the rows of P are the 

vJ and let J := {j G [n] ] cj ^ 0} be the support of oi on this basis. Since G has 
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an arrow to 1, oi is not identically zero, and hence J 7 ^ 0. Now (jS]) and B = xio^ 
implies that vjxi =0 for j S J. We claim that J =[n], so that xi = 0, as desired. 

To see this, write and note that P~^ is the matrix whose columns 

are the column eigenvectors of Ai. If the graph Gi corresponding to Ai is strongly 
connected, then we know from Lemma IQ that P ^ contains no zeros, and since 
oi is independent of Ai and not identically zero, we find that c has no zeroes. 

In the general case, let Ci,... ,Ci be the strongly connected components of Gi , 
and let u be a column of P~^. By Lemma [Q for each component Ci, the entries 
of u corresponding to the vertices of Ci are either all zero or all nonzero. The 
eigenvector u must be nonzero on at least one component Ci, and if this component 
has an edge to vertex 1 in the original graph G, then afu^O. If Q does not have 
an edge to vertex 1 in G, there must be a path in Gi from Ci to some component Cj 
which does have an edge to I in G, as G is strongly connected. Then Lemma 13.71 
implies that for every vertex k on this path we have Uk 7 ^ 0. In particular, all 
entries of u corresponding to vertices of Cj are nonzero, and since Cj has an edge 
to 1 in G, again we obtain afu^^O. Hence = of P~^ has no zero entries. □ 

This proposition implies that when looking for X G Va / + T>„), it 

suffices to search for X whose first row, first column and diagonal are all zero. 
By definition, Va contains all X G for which [X, H] lies in the kernel of the 
differential c?ac. From Proposition 13.21 we know that this implies that [W, H] G Qq 
and there must exist Y G 0 l„_i such that [H,Hi] = [X,H]i. For X G 0 [„ whose 
first row, first column and diagonal are all zero, we can just take Y = Xi, since 


' 0 

0^ ■ 


Oil 

T 

ai 


Oil 

T 

ai 


' 0 

0^ ■ 


0 

afXi 

0 

Xi 


02 

Hi 


02 

Hi 


0 

Ai 


Xia2 

[Xi,Ai] 


Hence for X of this form we have 

X gVa ^ [X,A] G ©G- 

Combining this observation with Corollary 13.61 and Proposition 13.81 we obtain the 
following corollary. 

Corollary 3.9. G has the expected dimension if and only if, for sufficiently general 
A G ©G) there does not exist X G of the form Xu = Xu = Xu = 0 for all 
i G [n], X 7 ^ 0, such that the commutator [Hi, H] lies in the parameter space ©g. 

Thus, to determine whether a graph has the expected dimension, we need to 
check whether there exists X G satisfying the properties of Corollary 13.91 Con¬ 
sider the following condition on the parameter matrix A = A(G): 

Condition 3.10. There exists an ordered pair (*,j) with i,j G {2,... ,n}, i 7 ^ j, 
such that the support of the j-th row is contained in the support of the Lth row 
of A and the support of the i-th column is contained in the support of the j-th 
column of A. 

For a strongly connected graph G, the matrix A = A{G) satisfies the above 
condition whenever there exist vertices i, j 7 ^ 1 such that for all k G [n] the following 
holds: for any edge k ^ j there is also an edge k ^ i and for any edge i ^ k there 
is also an edge j —>■ k. Also the nonzero entries an and Ujj of A should be taken 
into account, which implies that both aij and aji are nonzero, i.e. i and j form a 
2-cycle in G. 
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Figure 2. Example fS.llI 


Example 3.11. Consider the graph G in Figure [5] and its parameter matrix 

0 ai4 
023 0 

033 0 

043 044 

Observe that the pair (2,3) satisfies Condition 13.101 Let X = E 23 be the matrix 
with a 1 at position (2,3) and zeros elsewhere, then [X, A] has the correct zero 
pattern: 

0 0 0 O' 

0 032 033 — 022 0 

0 0 —032 0 

0 0 0 0 

This shows that E 23 represents a nontrivial element of Va / (Zg[^( 2 l) + £)„) and 
hence G does not have the expected dimension. 

It turns out that Condition l3.10l is a sufficient condition for Va / [Zgi^{A) + Dn) 
to be nontrivial, as we show in the next lemma. Let Eij be the matrix with a 1 at 
position {i,j) and zeros elsewhere. 

Lemma 3.12. Let G be a strongly connected graph such that the pair (i,j) satisfies 
Condition \3.1(]\ with A G ©< 3 . Then the matrix X = Eij G yields a nontrivial 
class [X] gVa / (^ 0 i„ ( 2 I) + Dn) ■ 

Proof. From i,j G {2,..., n} and i j, it follows that X is of the correct form: it 
has its first row, first column and diagonal all zero. According to Corollarv l3.9l we 
only need to show that [.A, A] G ©g- Consider the two terms of the Lie bracket: 
XA has its i-th row equal to the j-th row of A and zeros elsewhere, while AX has 
its j-th column equal to the z-th column of A and zeros elsewhere. Clearly A itself 
must have the correct zero pattern, so from Condition l3.10l we immediately see that 
XA — AX must be in Qq. □ 

When a graph G satisfies Condition l3.101 this lemma allows us to conclude that 
G does not have the expected dimension by only inspecting A(G). However, Con¬ 
dition 13.101 is not a necessary condition for G not to have the expected dimension. 
In the next section we will derive a criterion to decide for a given graph whether 
or not there exists X G satisfying Corollary 13.91 

3.3. A new criterion based on matrix rank. Let X G g[„ have its first row, 
first column and diagonal all zero and write 

X = XijEij. 

(uDei 


[A,A] = 


A{G) = 


Oil 

021 

0 

0 


0 

022 

tt32 

0 
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Define two sets L, R as follows: 

L = I hj e {2,...,n} and i ^ j} 

R = {{kj) I k,l € [n], k ^ I and I ^ k is not an edge of G} 

Note that L corresponds to all positions of X that are outside the first row, first 
column and the diagonal. Also note that R corresponds to all zero positions of 
A(G). 

The constraint [A, A] € 0g from Corollary 13.91 gives rise to a system of linear 
equations in the entries of X with coefficients that are linear A. To see what 
this expression looks like, consider the two terms of the Lie bracket [Eij , A]. The 
product Eij A has its *-th row equal to the j-th row of A and zeros elsewhere, while 
AEij has its j-th column equal to the Tth column of A and zeros elsewhere. Hence 
Eij adds a nonzero term to position (fc, 1) of [X, A] only in the following three cases: 

i = k, j ^ I and I ^ j G G ~>^ji 

i 7 ^ k, j = I and i ^ k G G aki 

i = k and j = I akk — clu 


Define the matrix B{G) G as 


(9) 


B(G)(k,i),(i,j) 


{ -aji 

O^kk — CLll 

0 


a i = k, j l and I ^ j G E 
ii i ^ k, j = I and i ^ k G E 
a i = k and j = I 
otherwise. 


From our previous observations, it follows that B{G) is the coefficient matrix cor¬ 
responding to the system of equations obtained from [X, A] G ©g- 

Let X G be the vector of coefficients a;^, {i,j) G L, then the linear system 
corresponding to [X, A] G ©g is given by 

B{G)x = 0. 

It follows that each solution x G gives rise to a class [A] G Va I (A) -I- Vn) 
and vice versa. Furthermore, a: = 0 if and only if A = 0. Combining these 
observations with the fact that B{G) has a nontrivial kernel if and only if its rank 
is less than |L|, we obtain the following theorem. 

Theorem 11.71 Let G = {V,E) he a graph satisfying Assumvtions Then 

G has an identifiable scaling reparametrization if and only if the matrix B{G) as 
defined in has full column rank. 

If \R\ < \L\, then B{G) certainly has rank smaller than \L\. However, this implies 
that the number of zero positions of A is less than (n — l)(n — 2). Since the number 
of zero positions in A equals — {n + rn), we obtain 

— (n -I- to) < (n — I)(n — 2) 

and hence to > 2(n — I). This is equivalent to Lemma [2.21 which stated that if 
m > 2n — 2 then G does not have the expected dimension. 

Remark 3.13. This condition is related to the existence of a perfect matching in 
the bipartite (undirected) graph H (G) = (L U i?, E) whose edges are defined by 

E = {((i,j), {kj)) I [Eij, A] is nonzero on position {k,l)} . 
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If we define edge weights for the edges in the bipartite graph H (G) by 

w{{i,j),{k,l)) = [Eij,A]ki 

with {i,j) G L and {k,l) £ R, then B{G) is the weighted bi-adjacency matrix 
corresponding to H (G). So if G has an identifiable scaling reparametrization, then 
B{G) has full rank and therefore there exists an L-saturating matching in H |Tut47| . 
In other words, an L-saturating matching in H(G) is a necessary condition for 
G to have the expected dimension. However, this is not a sufficient condition: 
Example lS.Idl shows that although an L-saturating matching in H exists, the matrix 
B{H) does not have rank |L|. 

Example 3.14. Let G be the graph given in Figure [5a1 with its corresponding 
bipartite graph H(G) in Figure l3bl where the thick edges represent an L-saturating 
matching M. Note that |L| = |i?|, so this is actually a perfect matching. The matrix 
B(G) is of the form given in Figure |33 One can check that the last six columns 
of this matrix are linearly dependent, hence B(G) has rank II while |L| = 12. In 
other words, G does not have the expected dimension. 

Remark 3.15. The dimension criterion given by Meshkat and Sullivant allows us 
to check whether a given graph has the expected dimension or not, by computing 
the double characteristic polynomial map c and determining the rank of (Iac in 
sufficiently general A £ ©g- This can be done by substituting random parameter 
values from some large (but finite) set S and evaluating the rank of (Iac in this point. 
The Schwarz-Zippel lemma ensures that by taking S large enough, the probability 
of a false negative can be made arbitrarily small. 

Our new criterion suggests a different randomized algorithm: we construct the 
matrix B[G) as defined in equation ([9]) and compute its generic rank. Again, this 
can be done by substituting random parameter values from a sufficiently large set 
S. Using this algorithm, we avoid calculating the double characteristic polynomial 
map and its differential map. 

4. Properties and constructions 

In the previous section we have seen a new criterion to decide whether a given 
graph has the expected dimension, i.e. whether there exists an identifiable scaling 
reparametrization. We shall now consider the question of how we can extend a given 
graph with the expected dimension by adding vertices and edges, such that the 
resulting graph has the expected dimension as well. Some constructions satisfying 
this property were already presented in |MSI4] , but using Theorem 11.71 we can 
derive stronger results. This section is concluded with some computational results 
for graphs on four and five vertices. 

4.1. Definitions and earlier results. 

Definition 4.1. A graph G is said to have an exchange with % £ {2,..., n} if both 
1 —>■ i and i ^ 1 are edges in G. More generally, a graph has an exchange if there 
exists i £ V such that G has an exchange with i. 

If a graph has an exchange, one of the operations that we can apply is the collapse 
of two vertices: 
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(A) G 

(1,2) (1,3) (2,3) (2,4) (2,5) (3,2) (3,4) (3,5) (4,1) (4,3) (5,1) (5,2) 



(B) H{G) 



(2,3) 

(2,4) 

(2,5) 

(3.2) 

(3,4) 

(3,5) 

(4,2) 

(4,3) 

(4.5) 

(5,2) 

(5,3) 

(5,4) 

(1,2) 

0 

0 

0 

0 

0 

0 

Ol4 

0 

0 

Ol5 

0 

0 

(1,3) 

0 

0 

0 

0 

0 

0 

0 

Ol4 

0 

0 

Ol5 

0 

(2,3) 

022 — O33 

0 

—053 

0 

0 

0 

0 

0 

0 

0 

0 

0 

(2,4) 

0 

022 — 044 

—054 

0 

0 

0 

0 

0 

0 

0 

0 

0 

(2,5) 

0 

—045 

022 — 055 

0 

0 

0 

0 

0 

0 

0 

0 

0 

(3,2) 

0 

0 

0 

033 “ 022 

—042 

0 

0 

0 

0 

0 

0 

0 

(3,4) 

0 

0 

0 

0 

033 ~ 044 

—054 

0 

0 

0 

0 

0 

0 

(3,5) 

0 

0 

0 

0 

—045 

033 — 055 

0 

0 

0 

0 

0 

0 

(4,1) 

0 

0 

0 

0 

0 

0 

-021 

—031 

0 

0 

0 

0 

(4,3) 

042 

0 

0 

0 

0 

0 

0 

044 — 033 

053 

0 

045 

0 

(5,1) 

0 

0 

0 

0 

0 

0 

0 

0 

0 

-021 

-031 

0 

(5,2) 

0 

0 

0 

053 

0 

0 

054 

0 

0 

055 — 022 

0 

—042 


(c) B{G) 


Figure 3. Example 13.141 a graph G with an L-saturating matching, 
yet B{G) does not have full rank. 


Definition 4.2. Given a graph G = {V^ E) that has an exchange with i, the 
collapsed graph G' = {V',E') is the graph in which vertex 1 and i have been 
identified, with V' = V\ {i}. An edge tt —>• u appears in G' if u —>• u appears in G, 
or if u = 1 and tt —>• i is an edge in G, or if it = 1 and i ^ u is an edge in G. 

Figure |4] illustrates an exchange with vertex 2 in G and the collapsed graph 
G'. When collapsing two arbitrary vertices, it is hard to tell whether the resulting 
graph will have the expected dimension or not. In some special cases where G has 
an exchange with i and G' is obtained by collapsing the exchange, i.e. vertices 1 
and i are identified, we can predict whether or not the collapsed graph will have 
the expected dimension. 


Identifying two vertices reduces the number of vertices by one. Conversely, we 
can also increase the number of vertices, for example by subdividing an edge. 
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Figure 4. Left: Exchange (dashed) and line segment (thick) in G. 

Right: The graph G' obtained by collapsing the exchange. 


Definition 4.3. Let G = (V, E) be a graph on n — 1 vertices and let f —>■ j be an 
edge in G. The graph G' = {V', E') obtained by subdividing the edge i ^ j has 
vertex set V = V G {n} and edges E' = {E \ {i ^ j}) U {i ^ n,n ^ j}. 

Another way to increase the number of vertices is by adding a line segment to 
G: choose two vertices k, I of G, add new vertices ni,..., rig, and add the edges of 
the path (fc, ni, n 2 ,..., rig, 1). This is called a line segment, as defined below. 

Definition 4.4. A line segment of length /c > 2 in G is a path (uq, vi,..., Vk) such 
that uq —>■ ui,..., Vk-i —t Vk are edges in G and these are the only edges incident 
to Vl, . . . jUfc-l. 

Note that given an edge f —>■ j in G, subdividing this edge creates a line segment 
of length two, since the new vertex n is incident to i and j but no other vertices. 
Figure m illustrates a line segment of length two in the graph G. 

Definition 4.5. A graph G = (V, E) is minimally strongly connected if it is strongly 
connected and for each edge e G E the graph {V, E \ {e}) is no longer strongly con¬ 
nected. G is said to be inductively strongly connected if there exists some ordering 
of vertices of the n vertices, say 1 ,... ,n, such that for each i G [n] the induced 
subgraph G|i containing vertices 1 ,... is strongly connected. 

Because of Lemma 12.21 we assume that the number of edges is at most 2n — 2. 
We say that a graph is maximal if it contains exactly 2n — 2 edges. 

Observe that if G is inductively strongly connected, then it must have at least 
2n — 2 edges. Hence any inductively strongly connected graph that satisfies the 
bound on the number of edges (m <2n — 2) is maximal. 

Meshkat and Sullivant have already proven some constructions to obtain graph 
with the expected dimension, and derived some properties of such graphs. The 
proofs of these results can be found in |MS14] . and some can also be derived from 
our results in Sections I4.2M.3I 

Proposition 4.6 ( [MS14[ Prop. 5.3]). Let G be a strongly connected maximal graph 
that has the expected dimension. Then G has an exchange. 

ff a graph is not maximal, then an exchange is not a necessary condition for a 
graph to have the expected dimension. For example, any directed cycle has the 
expected dimension. This follows immediately from Corollary 14.191 and the fact 
that a cycle is minimally strongly connected. 

The first construction that we consider is to add an exchange to a given graph. 
The proof in [MS14] considers the characteristic polynomials of the corresponding 
parameter matrices, but this proposition is also an immediate consequence of the 
fact that a cycle has the expected dimension and Proposition 14.121 in Section 14.21 
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Proposition 4.7 (' [MS141 Prop. 5.5]). Let G be a graph on n vertices and construct 
G' from G by adding a new vertex V and an exchange 1 —>■ 1', 1' —>■ 1. Then the 
resulting graph G' with input-output node V has the expected dimension if and only 
if G has the expected dimension. 

The next proposition shows that adding a line segment of length two to a graph 
with the expected dimension again yields a graph with the expected dimension, 
under the condition that G has a ‘chain of cycles ’ containing both vertex 1 and the 
line segment. We do not go into details about this concept, because a we will remove 
this restriction and extend the result to longer line segments in Theorem 14. 141 

Proposition 4.8 f [MS141 Thm. 5.7]). Let G' be a graph that has the expected 
dimension with n — 1 vertices. Let G be a new graph obtained from G' by adding 
a new vertex n and two edges k ^ n and n —>■ I and such that G has a ‘chain of 
cycles’ containing both 1 and n. Then G has the expected dimension. 

Recall that an inductively strongly connected graph can be constructed by adding 
the vertices one by one, while in each step the corresponding subgraph is strongly 
connected. Combining this fact with Proposition 14.81 one can derive the following 
corollary by induction on the number of vertices. 

Corollary 4.9 1 [MS14[ Thm. 5.13]). If G is inductively strongly eonnected with at 
most 2n — 2 edges, then G has the expected dimension. 

Meshkat and Sullivant have also formulated a conjecture: 

Conjecture 4.10 ( [MS141 Conj. 6.6]). Let G be a graph with n vertices, 2n — 2 
edges, and an exchange with i. Let the eollapsed graph G' be the graph where 1 and 
i have been identified. If G' has 2n — 4 edges with an exchange, then G has the 
expected dimension if and only if G' has the expected dimension. 


We have constructed a counterexample, showing that this conjecture certainly 
does not hold in both directions. Consider the graph G in Figure I5al which is 


strongly connected, has an exchange, 
matrix is given by 


A(G) = 


and satisfies m = 2n — 2. Its parameter 


ail 

Ol2 

0 

014 

0 

0 

021 

022 

023 

0 

0 

026 

0 

032 

O33 

0 

0 

0 

0 

0 

0 

044 

045 

0 

0 

0 

O53 

0 

055 

0 

0 

0 

0 

064 

065 

066 


One can check that G has the expected dimension using Mathematica and the 
algorithm based on Theorem 11.71 After collapsing the exchange with 2, we obtain 
the graph G' given in Figure ISbl This graph has parameter matrix 


A{G') = 


Oil 

Ol3 

014 

0 

O16 

O31 

033 

0 

0 

0 

0 

0 

044 

045 

0 

0 

053 

0 

055 

0 

0 

0 

064 

065 

066 


We see that G' is again strongly connected, has an exchange and satisfies m = 2n—4. 
However, G' does not have the expected dimension. This follows from the fact that 
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Figure 5. Counterexample to Conjecture 14.101 


A{G') satisfies Condition 13.101 the support of the column corresponding to vertex 
6 is contained in the column corresponding to vertex 4, and for the rows vice versa. 
So the fact that G has the expected dimension does not imply that G' has the 
expected dimension as well. 

The other direction remains a conjecture, although a partial result follows from 
Proposition 14.201 

4.2. New constructions. In this section we present two new constructions of 
graphs with the expected dimension. These proofs rely on Theorem 11.71 so the 
matrix B = B{G) plays an important role in this section. Recall that the rows of 
B are indexed by pairs (z,j) corresponding to the zero positions of A{G), and the 
columns are indexed by pairs {k,l) with fc, Z C [n], /c ^ Z and k,l ^ 1. These column 
indices correspond to the entries of X S 0 [„ outside the first row, the first column 
and the diagonal. The entries of B are given in 

We will refer to the entry B{G)(^i^ i)^(^i j^ as the entry (or position) indexed by 
{k,l), {i, j), where {k,l) is the row index and {i,j) the column index. We start 
with some basic observations on the structure of B = B{G). For an entry to be 
nonzero, the two pairs representing the row and column index must have at least 
one coordinate in common. Entries indexed by (i, ■),{■, i) or (•,i),(i,-) are zero, 
since neither the rows nor the columns of B have indices {i,i). Furthermore, the 
column indices have no coordinate equal to 1 , hence a nonzero entry in the row 
indexed by (*, 1) must be of the form aji. Similarly, a nonzero entry in the row 
indexed by (1, i) must be of the form aij. Also note that every entry in a given row 
or column of B contains a different parameter. 

Both proofs have the same structure: to show that the matrix B has full rank, 
we group the rows and columns such that we obtain a block matrix. Then we argue 
that each of the diagonal blocks has full rank, and that the non-diagonal blocks 
cannot cancel this term from the determinant of B. When constructing a graph G' 
from G, we choose the blocks such that one of the diagonal blocks is of the form 
B{G'). The other diagonal blocks will be similar to the parameter matrix A, except 
that some rows and columns may be missing. Therefore the following lemma will 
be very useful: 

Lemma 4.11. Let G be a strongly connected graph on n vertices and let A G Qq- 
For k, I € [n] and a G <C define Ak^i^a to be the submatrix of A obtained by replacing 
the diagonal entries an by an — a for all i G [n], and removing row k and column 
1. Then for sufficiently general A the determinant of A^y^a is nonzero. 

Proof. Since G is strongly connected, there exists a path p from k to I, say 

p = (k = Vi,V2, . . . ,Ur-l,fr = O' 
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Let iir+i, ■ ■ ■ ,Vn be the vertices of G that do not appear in p. Rearrange the rows 
and columns of Ak,i,a such that the row indices are ordered as 

'^2 7 '^ 3 ; • ■ • ) 5 • 5 

and the column indices are ordered as 

'I'l, V2 ■ • ■ ) Vr— 1 j Vr ■ 5 '^n ■ 


Then Ak,i,a has diagonal 

(a«2«l J 0,v^V2 ) • ■ • ) O^Vj-Vr-l ) fl^r+ll'r+l ~ 01, . . . , Clv^v„ ~ Of) 

whose entries are nonzero for sufficiently general A. All entries of Ak,i,a correspond 
to different parameters, so taking the diagonal entries large enough will make the 
determinant of A^^i^a nonzero. Having full rank is a Zariski open condition on the 
parameters, so it follows that Ak^i^a has full rank for sufficiently general A. □ 

Now we will derive our first new construction, taking the union of two graphs 
which have exactly one vertex in common. This vertex has to be the input-output 
compartment of at least one of the two graphs. The resulting graph inherits only 
one input-output compartment, such that it still satisfies Assumption 11.21 

Proposition 4.12. Let G he of the form {V' U V'',E' U E") for some graphs 
G' = {V', E'), G" = {V", E"), such that V'DV" = {u}, E'nE" = 0 and 1 e H'. Let 
1 be the input-output compartment of G', while G" has input-output compartment 
V, and let G inherit 1 as its unique input-output compartment. Then G has the 
expected dimension if both G' and G" have the expected dimension. Conversely, if 
G" does not have the expected dimension, then neither does G. 

Proof. Let A = A{G), A' = A[G') and A" = A{G''). The input-output compart¬ 
ment of G" is vertex v, so if we order the vertices of G' such that the last row and 
column of A' correspond to vertex v, then A is of the following form: 


1 _ 

1 

0 


1 _ 

1 

0 


The matrices A', A" intersect at only one position, which is the entry containing 
Ovy. Let B = B{G), B' = B(G') and B" = B{G"). First, we will derive that B 
has full rank whenever both B',B” have full rank, thus proving the first part of 
the proposition. To do so, we partition the matrix B into blocks, such that some 
of these blocks are equal to B', B". Recall that the rows of B are indexed by the 
zero entries of A, and the columns of B are indexed by the pairs (z, j ) with i,j^l 
and i ^ j. We find a block partition of B by partitioning both A and X, since this 
gives us a partition of the row and column indices. Let A', A", A 3 , A 4 be blocks of 
A and let X',X "be blocks of X of the form: 
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A 


V \ {tf} V V" \ {i;} 


V \ {«} V V" \ {u} 



1/'\W 


E"\M 





E' \ {*,} 


l/"\W 


The solid lines indicate the partitioning and the dotted lines indicate the position 
of the row and column indexed by vertex v. We obtain a partition of the rows and 
columns of B by distinguishing between the four blocks of A and X, respectively. 
Note that the blocks of A do not form a partition of the matrix A, because A' and 
A" intersect. However, they intersect at a nonzero position, so this position does 
not appear as a row index of B. Therefore the blocks of A induce a well-defined 
partition of the row indices of B. We obtain the following block matrix: 


X' X" X 3 Xi 


A' 

B' 

0 

Cl 

C 2 

A" 

0 

B” 

0 

0 

A 3 

Q 

0 

Di 

0 

A 4 

G 4 

0 

0 

D 2 


The zero positions of the matrices A', A" are exactly the row indices of B',B", 
respectively. Furthermore, the positions of X',X" which are outside the first row, 
first column and diagonal of X yield exactly the column indices of B', B", respect¬ 
ively. Each edge of G' also appears in G, so by definition of B' the block 
indexed by A' and X' is indeed the matrix B' corresponding to G', and similarly, 
the block indexed by A" and X" is exactly the matrix B" corresponding to G". 
Now consider the block indexed by A' and X"] if (fc, 1) is a zero position of A' and 
{i,j) is a nonzero position of X", then these two pairs only have a coordinate in 
common when j = l= v 01 i = k = v. Then the corresponding entry of B is of the 
form Ofci or —aji, respectively, with i,j G V" \ {u} and k,l G V. However, there 
are no such edges j —Z or fc —>■ z in G, because these would correspond to edges 
between G' and G" not incident to v. Hence the corresponding entry of B is zero, 
and therefore the entire block indexed by A!^X" is zero. A similar analysis shows 
that each of the blocks of B denoted with a zero indeed is a zero matrix. 

Next, we analyze the blocks Gi, G 2 , G 3 and G 4 . For Gi to have a nonzero entry, 
we need a position (fc,Z) of A! and a position (i,j) of A 3 to have a coordinate in 
common. From the way that A and X have been partitioned, we see that the only 
option is k = i. This gives the entry —Oji, where j G V" \ {u} and I G V. The only 
parameters of this form are ajy, with w —^ j an edge in G”. Therefore, the nonzero 
entries of Gi are indexed by (i,j) such that v ^ j is an edge in G, and the 
corresponding entry is of the form —ajy. Similarly, the nonzero entries of C 2 are 
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indexed by {v, j) such that i ^ is an edge in G", and the corresponding 
entry is of the form avi- For the block G 3 the same analysis shows that all nonzero 
entries are of the form — a„i, while the nonzero entries of G 4 are of the form ai„, 
i £ V" \ {n}. The exact form of these blocks is not important for our further 
analysis, all we need is that the only nonzero entries are either ± 0 ^,^ or ± 0 ^ with 
^ e y" \ {n}. 

Finally, consider the block Z?i, which is indexed by The block has 

size {\V'\ — 1)(|F"| — 1) and consists entirely of zeros, so the number of rows of 
Di equals (|F'| — l)(|y"| — 1). The columns of Di are indexed by the block X3 
of size — !)■ Since the first row of X must be zero, X 3 gives only {\V'\ — 

l)(|y"| — 1) column indices. We conclude that Di is square, hence we can calculate 
its determinant to see whether it has full rank. 

Observe that A^ yields pairs {i,j) with i &V' \ {i;} and j £ V" \ {i;}, while the 
pairs corresponding to X 3 are of the form (i, j) with i G V'\ {1} and j £ V" \ {u}. 
Order both the row and column indexes by their second coordinate, then we obtain 
diagonal blocks of Di of the form for all a gV"\ {i;}, with defined 

in Lemma [4. Ill It follows that naey"\{?)} det(AJ, q,) is a nonzero term appearing 
in the determinant of Di. The nonzero entries of Di outside the blocks 1 „ are 
parameters from A" ^ because the corresponding indices can only have their first 
coordinate in common. These parameters are therefore determined by the second 
coordinates of their indices, which are from V" \ {n}. Since the determinants of 
A'y I a contain only parameters from A' ^ the product of those determinants cannot 
be canceled out when calculating the determinant of Di. 

A very similar argument (ordering rows and columns by their first coordinate) holds 
for the determinant of D 2 , so we conclude that the determinants of Di and D 2 are 
generically nonzero. 

Now suppose that both B' and B" have full rank. These matrices do not need 
to be square, since the number of rows may be larger than the number of columns. 
However, being full rank means that there exists a subset of the rows such that 
the corresponding matrix is square and invertible. Let B', B" be such square sub¬ 
matrices with nonzero determinant, and let B be the corresponding square sub¬ 
matrix of B. Then from the structure of B, we see that the determinant det(il) 
contains a term 

det(i?') det(i?") det(Ili) det(Zl 2 ). 

Moreover, the determinant of B contains a factor det(il"), because all other entries 
in the corresponding rows and columns are zero. The nonzero off-diagonal blocks 
only contain entries of the form ±ayj and ia^i, with j £ V" \ {i;}, but these entries 
do not appear in Di,D 2 or B'. Therefore the term above can never vanish, i.e. B 
has nonzero determinant. 

The second part of the proposition follows directly from the fact that the de¬ 
terminant of B contains a factor det(i?"): if det(i3) is nonzero, then det(i3") must 
also be nonzero. □ 

Note that the proof does not rely on any assumption on the shared vertex v, so 
it may also be that G' and G" have their input-output compartment in common. 

Remark 4.13. Let G,G',G" be as in Proposition I4.12| We have just seen that 
if both G', G" have the expected dimension, then so does G. Conversely, if G' 
does not have the expected dimension, this does not necessarily imply that G does 
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Figure 6. G' U G” , where the edges of G" are dashed. 


not have the expected dimension. For example, the graph in Figure |6] has the 
expected dimension, while its subgraph G' does not. However, if V' fl V" = {!}, 
then applying the proposition twice shows that G has the expected dimension if 
and only if both G' and G" have the expected dimension. 

For our next result, recall Proposition l4.8l of the previous section; it states that if 
G' is a graph on n — 1 vertices which has the expected dimension and we construct 
G from G' by adding a new vertex n and two edges k ^ n and n —>■ I such that G 
has a chain of cycles containing both 1 and n, then G has the expected dimension 
as well. Using Theorem 11.71 we present a stronger version of this theorem: 

Proposition 4.14. Let G = (V^E) on n — 1 vertices he a graph with the expected 
dimension. Construct G' from G by adding new vertices ni,... ,ns and edges k —>■ 
ni, Ug —)• Z and Ui rii+i for i = 1,..., s — 1, where k,l € V are vertices of G. 
Then G' has the expected dimension. 

Proof. Let A = A{G) and Ad = A{G') and observe that H is a submatrix of A', since 
G is a subgraph of G'. Consider the coefficient matrices B = B{G) and B' = B[G') 
corresponding to G and G', respectively. We will use the fact that B has full rank 
to show that also B' has full rank. Let B be obtained from B by deleting a subset 
of the rows, such that B has nonzero determinant. Let B' be obtained from B' by 
deleting the same subset of the rows, and additionally deleting the rows indexed by 
(l,np) for all p G [s]. We rearrange the rows and columns of B' by distinguishing 
between indices of the form (z,j) with i,j G V and indices of the form (rip,r) or 
(r. Up) with p G [s] and r G U U {ug \ q < p}. We claim that this brings B' into the 
form given below (Figure [7]), where the empty blocks are all zero, and the blocks 
containing a parameter contain both zero entries and entries of the form atj. 

To show that B' is indeed of this form, we consider the different blocks one by 
one, starting with the upper left block. From the fact that H is a submatrix of A', 
it follows that the submatrix of B' corresponding to the rows and columns indexed 
by pairs of the form (i,j) with i, j G U is exactly the matrix B. 

The remaining positions in the rows indexed by (*, j) with i, j G V have columns 
indexed by (rip, r) or (r, Up) with p G [s] and r G U U {uq \ q < p}. By definition of 
B' Q, the only nonzero entries occur when r = j or r = i, respectively, and this 
gives the entry or aju^. However, the only entries of A' of this form are 
and ain^. A similar analysis of the positions in the columns indexed by (z,j) with 
G V but outside B, shows that the only nonzero entries are of the form a„ife 
and ain^ as well. 

Next, consider the blocks Dp, where p G [s]. The positions of Dp are indexed 
by (i,np), {j,np), hence they have at least their second coordinate in common. If 
i ^ j, the corresponding entry is of the form Oij, and if r = j we obtain an — a„pnp- 
Because of our ordering of rows and columns, we have i,j € V U{nq \ q < p}, i ^ 1, 
and by definition of B' = B{G') also j ^ 1. Therefore the block Dp equals the 
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V (->"1) (’*!.■) (■,’^2) (H2.') (-.113) ("3,-) . (-.n.-l) (n,,-) 


B 













Di 


^712771 








^lUs 

^nik 


D[ 













D2 


^773722 









^n2ni 


D'2 



















^lUs 





^713772 


D's 


























Ds-i 


^77s77s_i 

^lUs 










DU 



^lUs 


^lUs 




^lUs 



^lUs 

Ds 













_ 


Figure 7. Block form of B' 


submatrix of A' obtained by deleting rows and columns indexed by 1 or Ug with 
q > p. Furthermore, the diagonal entries an of A have been replaced by entries 
da — <inpnp- Thus, the block Dp is a block lower diagonal square matrix of the form 




^77iA; 

^77i77i ^77p77p 


^772 771 


^77p_i77p_2 <^77p_i77p_i ~ *^77p77p 


with determinant 


det(Dp) 


det(Ai,i,„p) if p = 1 

det(Al^l^72p) Oj —1 GripTip) if P ^ 2. 


Analogously, the row and column indices of blocks D'^ all have their first co¬ 
ordinate in common. The positions are indexed by {np,i), {up, j), which implies 
that the entries are of the form —aji or —{an — Note that we obtain —aji 

instead of o^, so if we apply a similar analysis as we did for Dp, we obtain some 
submatrix of —{A')'^. Furthermore, there is no row index of the form (np,np-i), 
since the edge Up-i —)■ Up occurs in G'. Hence —{Dp)'^ equals the submatrix of A' 
obtained by deleting rows and columns indexed by Ug with q > p, and also deleting 
column np-i and row 1. Again, the diagonal entries an of A! have been replaced by 
entries an — o-upUp- If we separate column k from the rest of the columns indexed 
by {np,i) with i £ V, we see that the block Dp is a block upper diagonal square 
matrix of the form 
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-{d;) 


1 


^UiTli ^UpTlp 


^nik 



^n2n\ 



^np_2np_2 ~ ^ripUp 



^np_inp_2 


and its determinant satisfies 

{ det(Ai,fe,„J ifp=l 

det(Ai,fe,„Ja„ifc ifp = 2 

+ if P ^ 3. 

Finally, consider the entries which do not lie in any of the blocks Dp or Dp or 
in the rows and columns denoted by V in B'. These rows are indexed by (rip, •) or 
(•,np), while the columns are indexed by [uq, •) or {■,nq) with q ^ p. If two such 
pairs have an entry in common, then the two remaining entries determine the value 
of the corresponding entry of B'. Equation (flUl) gives an overview of all possible 
pairs of indices having a coordinate in common, the corresponding entry of B' and 
the conditions for this entry to be nonzero. 


{np,i), {nq,i) 


^UpUq 


7^0iffp = q + l 

(i,np), {i,nq) 


^UqUp 


7^0iffp = q — 1 

np,nq), {i,nq) 


^ripi 

a p > q 

= 0 

nq,np), {nq,i) 


O^irip 

ii p > q 

0 iA p = s,i = 1 

np,i), inp,nq) 


O^riqi 

A p < q 

= 0 

B ^p) 7 (^<7 5 


^iriq 

A p < q 

^ 0 iA q = s,i = 1 


We conclude that B' is indeed as claimed. Since the diagonal blocks of B' have 
full rank, its determinant contains a term 

S 

mi = det(B) det(T>p) det(IIp), 

p=i 

and we will show that this term cannot be cancelled out. 

Recall that it is enough to show that det(i3') is nonzero for A in some Zariski 
open subset of 0, since we only consider sufficiently general A. Therefore, we can 
simplify things by setting ain^ = 0 and an^n,-i = 0. Note that these parameters do 
not appear in mi, because none of the Dp, Dp contain rows or columns of A' that 
are indexed by rig, hence setting these parameters^to zero does not affect mi. 

Suppose m 2 is a term of the determinant of B' which cancels out mi, then it 
must have at least one entry from outside the diagonal blocks of B'. From our 
previous observations, we know that these entries are of the form 

^ni fe 1 ^n 2 ni 5 • ■ • 5 ^ns_ irig —2 ’ 

since = aju^ =0. As we have seen, these parameters also appear in 

the determinants of Dp. More specific, an„_ins -2 occurs only in det(Z)'), while 
ans- 2 ns -3 occurs in det(I?') and det(Z?'_]^), continuing up to Umk which divides all 
of det(I?j),... ,det{Dg). Using this observation, we will argue that m 2 can never 
yield the term mi. 
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5 


A 


2 -► 3 


1 


t 


i 


T 


1 - 4 


Figure 8. Graph G'; obtained from G by adding a line segment (dashed). 


Consider the block Z?'. From the fact that we set to zero, it follows that 

the rows of B' indexed by (rig, •) are all zero outside ZZ'. Therefore, any term in 
the determinant of B' contains det(I?'). The parameter appears exactly 

once in mi, namely in det(I?'), so m 2 cannot contain another factor an^_ins- 2 - 
Now consider the block D'^_i and observe that the only nonzero entries in the 
rows of B' indexed by (ns_i, •) are of the form From the observation 

that m 2 cannot contain these entries, it follows that m 2 contains a factor det(iZ'_j^). 
The parameter an^_ 2 n ,-3 appears exactly twice in mi, namely in det(I?') and in 
det(Z5'_j^), hence m 2 cannot contain a third factor a„^_ 2 ns_ 3 - One can repeat this 
argument, showing step by step dat m 2 cannot contain any entries which are outside 
the diagonal blocks. However, this implies that mi cannot be cancelled out by m 2 
in the determinant of B'. This shows that det(H') is nonzero for sufficiently general 
H', thus proving that G' has the expected dimension. □ 


The proof of Proposition 14.141 does not need any restrictions on k,l € V, so we 
can add a cycle by choosing k = 1. 

The converse of Proposition 14.141 does not hold; if G does not have the expected 
dimension, then G' might still have the expected dimension. For example, consider 
the graph G' in Figure [8] which is obtained from the graph G by adding vertex 5 
and edges 3^-5 and 5^-2. The graphs G,G' have parameter matrices A,A\ 
respectively: 


Oil 

ai 2 

0 

014 

021 

^22 

0 

0 

0 

032 

033 

034 

0 

0 

043 

044 


Oil 

012 

0 

014 

0 

021 

022 

0 

0 

025 

0 

032 

033 

034 

0 

0 

0 

043 

044 

0 

0 

0 

053 

0 

055 


From the structure of A we see that G does not have the expected dimension, since 
the pair (3,4) satisfies Condition 13.101 On the other hand, one can check that G' 
does have the expected dimension using Theorem 11.71 


We conclude this subsection with a conjecture. 

Conjecture 4.15. Let G be a graph on n — 1 vertices and let k ^ I be an edge in 
G. Construct the graph G' on n vertices subdividing the edge k ^ I, i.e. by adding 
vertex n to G and replaeing the edge k ^ I by two edges k ^ n,n ^ 1. Then if G 
has the expected dimension, G' has the expected dimension as well. 

This conjecture has been verified for all graphs G on four and five vertices using 
Mathematica, as well as for larger random graphs. Unfortunately, the techniques 
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1 - 4 1 -4 1 - 4 1 - 4 

G S\ S 2 

Figure 9. A graph G with three different ear decompositions £\, 82 ,Si. 


we used to prove the previous propositions cannot be applied here so easily, because 
the matrix A{G) is not a submatrix of A{G'). 

4.3. Ear decompositions. This section describes how to construct graphs with 
the expected dimension using Propostion l4.14| Starting from a cycle, which has the 
expected dimension, we can add line segments to obtain new graphs, for example 
all minimally strongly connected graphs. This gives rise to a procedure to obtain 
a graph which has the expected dimension from a graph which does not have 
the expected dimension. An important concept that we shall be using is the ear 
decomposition of a directed graph, as defined in [BJG07] : 

Definition 4.16. Given a directed graph G = {V, E), let £ = {Pq, Pi, - ■ ■, Pt} be a 
sequence of cycles and paths in G, f > 0, and define Gi = {Vi,Ei) := PoUPiU.. .UP^. 
Then £ is an ear decomposition of G if Pq is a cycle, Gt = G, and each Pi is a path 
(uojUi, .. .,Vk), k>l, satisfying 

(1) vo,Vk € Vi-i (not necessarily distinct), 

(2) VO < z < j < fc : Vi e V \ Vi-i and Vi ^ Vj, 

(3) y0<i<j<k: Vi ^ Vj € E \ Pi_i. 

The Pi are called the ears of £, and if fc = 1 the ear Pi = (vg, vi) is called a trivial 
ear. 

Note that the graphs Go,..., G* are strongly connected, hence if G has an ear 
decomposition then it must be strongly connected. The converse also holds: if 
a graph is strongly connected, then it must have an ear decomposition |BJG071 
Thm. 5.3.2]. This can be seen from the fact that in a strongly connected graph 
every node lies on a cycle. 

A graph may have many different ear decompositions, as shown in Figure |9l 
Each of these decompositions has the same number of ears, namely m — n + 1 
|B.TGn71 Gor. 5.3.3]. 

Our purpose is to construct graphs with the expected dimension, using the results 
of the previous section. Therefore, we define a specific kind of ear decomposition: 

Definition 4.17. We say that a graph G has a nontrivial ear decomposition if it 
has an ear decomposition without trivial ears, and such that the initial cycle Pq 
contains vertex 1. 

Gonsider the graph G given in Figure [S] and the three ear decompositions 
£i,£ 2 ,£ 3 - In each £i, let Po,Pi and P 2 be the normal, dashed and dotted ears, 
respectively. Then the initial cycle Pq contains vertex 1 in each of the three decom¬ 
positions. However, £i is the only ear decomposition without trivial ears. In other 
words, £i is a nontrivial ear decomposition of G, but £2 and £g are not. 
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Theorem II.8L Let G be a graph that has a nontrivial ear decomposition, then G 
has the expected dimension. 

Proof. Let G have a nontrivial ear decomposition £ = {Pq, ..., Pt}. A nontrivial 
ear decomposition consists of nontrivial ears, and a nontrivial ear corresponds to 
a line segment (see Section 14.11) of length at least two. Since a cycle is known to 
have the expected dimension, it follows that Gq has the expected dimension, and 
we can apply Proposition 14. 141 1 times to conclude that Gi, G 2 ,..., G* all have the 
expected dimension. In other words, if a graph has a nontrivial ear decomposition, 
then it certainly has the expected dimension. □ 

Conversely, if a graph has the expected dimension, it does not need to have a 
nontrivial ear decomposition. For example, the graph in Figure |6] has no nontrivial 
ear decomposition, yet it does have the expected dimension. 

Proposition 4.18. A graph G is minimally strongly connected if and only if all 
its ear decompositions have no trivial ears. 

Proof. Suppose that G has an ear decomposition with a trivial ear. Then the 
graph obtained from G by deleting the edge of this trivial ear is strongly connected, 
because it has an ear decomposition. Hence G is not minimally strongly connected. 

Conversely, suppose that G is not minimally strongly connected, then it has an 
edge e that can be removed, such that the resulting graph G' remains strongly 
connected. Then G' has an ear decomposition, and adding the trivial ear that 
contains the edge e results in an ear decomposition of G that has a trivial ear. □ 

Because of the above proposition and the fact that a strongly connected graph 
has at least one ear decomposition, we can find for any minimally strongly connected 
graph a nontrivial ear decomposition. Combining this with Theorem 11.81 leads us 
to the following corollary. 

Corollary 4.19. If G is minimally strongly connected, then it has the expected 
dimension. 

The converse does not hold, because for G to have the expected dimension it is 
enough to have only one nontrivial ear decomposition. For example, the graph G 
in Figure [5] has a nontrivial ear decomposition, hence the expected dimension, but 
it is not minimally strongly connected. 

Proposition 4.20. Let G he a graph that contains a 2-cycle {i —>■ j,j —>■ z}, and let 
G' be the graph where the vertiees i and j have been identified. If G has a nontrivial 
ear decomposition, then so does G'. 

Proof. Let £ he a nontrivial ear decomposition of G. Any 2-cycle G in G must 
appear as an ear in £, because if there is an ear that contains only one of the two 
edges in G, then the other edge can only appear as a trivial ear. After identifying 
vertices i and j to obtain G', the 2-cycle no longer exists. A nontrivial ear decom¬ 
position of G' is obtained from £ by removing the ear that is equal to the 2-cycle 
and replacing vertex j by vertex i in the remaining ears. Note that the number of 
edges in these ears does not change, so they remain nontrivial. □ 

A special case of this theorem occurs when G has an exchange, and G' is obtained 
by collapsing the exchange. This shows that Conjecture 14.101 holds when G has a 
nontrivial ear decomposition. 
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Theorem 11.81 gives rise to two options to turn a graph that does not have the 
expected dimension into a graph that does have the expect dimension. If G does 
not have the expected dimension, then every ear decomposition of G contains a 
trivial ear. In order to transform the graph into one that has a nontrivial ear 
decomposition, start with an arbitrary ear decomposition and either remove the 
trivial ears, or subdivide the corresponding edges, such that they are no longer 
trivial. To keep the number of changes as small as possible, one should start with 
an ear decomposition with the smallest possible number of trivial ears. 

4.4. Relaixing model constraints. In the introduction of this paper, we set sev¬ 
eral assumptions on the models to be considered, thus reducing our research to a 
rather small class of models. However, the results for ear decompositions, presented 
in the previous section, still hold when relaxing Assumr)tion ll.2l and Assumntion ll.dl 
If a model has multiple inputs or outputs, besides compartment 1, this will only give 
more information and hence may even make the model identifiable. Furthermore, 
if not all compartments have a leak, this means that there are less parameters to be 
recovered; the remaining parameters might even be identifiable. So when relaxing 
our assumptions as described. Theorem 11.81 still holds. 

Moreover, the procedure described to turn a graph that does not have the ex¬ 
pected dimension into a graph that does have the expect dimension remains valid 
when relaxing our assumptions. It is important to realize though, that with more 
information available (due to multiple inputs or outputs) or less information re¬ 
quired (due to absent leaks), following this procedure could alter the graph much 
more than necessary. 

4.5. Computational results. In the previous section, we have seen two classes 
of graphs with the expected dimension: the graphs which have a nontrivial ear 
decomposition, and those which are minimally strongly connected. Moreover, from 
Corollarv l4.9l we know that all inductively strongly connected graphs (with at most 
2n — 2 vertices) have the expected dimension. Using the computer algebra package 
Mathematica, the cardinalities of these classes have been calculated for n = 3,4, 5. 

Let G{n) denote the class of strongly connected graphs on n vertices with at most 
2n — 2 edges, up to the following equivalence. Since vertex 1 has a special role, 
graphs are considered to be equivalent if they can be obtained from one another by 
permuting vertices 2, 3,..., n. Now we define the following subclasses of G{n): 


Q*{n) = {G G Gin) | G has the expected dimension} 

Gc{n) = {G G Gin) I G has a nontrivial ear decomposition} 
Gisc{n) = {G G G{n) I G is inductively strongly connected} 
f/MSc(’T') = {G G G{n) I G is minimally strongly connected} 


From Section 14.31 we know that 

Gusein) C Gc(n) C G*{n) C Gin). 

The class of inductively strongly connected graphs Giscin) is also a subset of Gdn), 
but f?MSc(*^) is not contained in 0isc(^) or vice versa. 
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n 

\G{n)\ 

ienn)i 

\Gc{n)\ 

|f?isc(R)| 

| 0 msc(r) 

3 

6 

5 

5 

4 

3 

4 

71 

43 

39 

26 

12 

5 

1472 

628 

450 

267 

57 


Table 1. Computational results 


The cardinalities of these classes (for n = 3,4,5) are presented in Table [TJ It 
shows that the class of graphs with a nontrivial ear decomposition is a large subset 
of G*{n), but the ratio \Qc{n)\/\G*{n)\ decreases as n grows. 

5. Conclusions and future work 

Inspired by the work of Meshkat and Sullivant, we have derived a new criterion 
to determine whether a graph has an identifiable scaling reparametrization. This 
criterion allowed us to derive two new constructions to obtain graphs for which an 
identifiable scaling reparametrization exists, extending the results of |MS14| . This 
led us to the concept of ear decompositions of graphs and a procedure to transform 
any graph into one that has an identifiable scaling reparametrization. 

The results presented in this paper are based on a couple of assumptions, that 
restrict the class of graphs considered. One of our main results. Theorem 11.81 and 
the application of this theorem to obtain graphs that admit an identifiable scaling 
reparametrization, both remain valid under relaxed assumptions. However, with 
more information available (additional inputs or outputs) or less parameters to 
identify (missing leaks), following this procedure could alter the graph much more 
than necessary. Therefore, it would be very interesting to see if our approach can 
be generalized to a less restricted class of graphs. Some work in this direction 
appeared recently |MSE15| . showing how to add inputs, add outputs, or remove 
leaks, in order to obtain an identifiable model. 

A nice starting point for further research would be Conjecture 14.151 Then, the 
next step is to consider more general models; for example, what can we say about 
the case where input and output do not take place in the same compartment? 
Suppose the input takes place in compartment 1, while the output takes place in 
compartment 2. This affects the input-output equation and hence also the double 
characteristic polynomial map. Similar to Theorem 12.11 the input-output equation 
becomes 

det(i9/„ — A)y = det(i9/„_i — A 2 )m, 

where A 2 denotes the matrix obtained from A by removing its hrst row and its 
second column. This equation gives rise to a coordinate map c', analogous to the 
definition of the double characteristic polynomial map c. It would be interesting to 
apply a similar analysis to the coordinate map c' as we did for c. 
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